Imitation Learning with Concurrent Actions in 3D Games

نویسندگان

Jack Harmer

Linus Gissl'en

Henrik Holst

Joakim Bergdahl

Tom Olsson

Kristoffer Sjoo

Magnus Nordin

چکیده

In this work we describe a novel deep reinforcement learning neural network architecture that allows multiple actions to be selected at every time-step. Multi-action policies allows complex behaviours to be learnt that are otherwise hard to achieve when using single action selection techniques. This work describes an algorithm that uses both imitation learning (IL) and temporal difference (TD) reinforcement learning (RL) to provide a 4x improvement in training time and 2.5x improvement in performance over single action selection TD RL. We demonstrate the capabilities of this network using a complex in-house 3D game. Mimicking the behavior of the expert teacher significantly improves world state exploration and allows the agents vision system to be trained more rapidly than TD RL alone. This initial training technique kick-starts TD learning and the agent quickly learns to surpass the capabilities of the expert.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient Reductions for Imitation Learning

Imitation Learning, while applied successfully on many large real-world problems, is typically addressed as a standard supervised learning problem, where it is assumed the training and testing data are i.i.d.. This is not true in imitation learning as the learned policy influences the future test inputs (states) upon which it will be tested. We show that this leads to compounding errors and a r...

متن کامل

Is Bayesian Imitation Learning the Route to Believable Gamebots?

As it strives to imitate observably successful actions, imitation learning allows for a quick acquisition of proven behaviors. Recent work from psychology and robotics suggests that Bayesian probability theory provides a mathematical framework for imitation learning. In this paper, we investigate the use of Bayesian imitation learning in realizing more life-like computer game characters. Follow...

متن کامل

Born to Learn: What Infants Learn from Watching Us

Imitation is a powerful form of learning commonly used by children, adults and infants. A child's enthusiasm for imitative behavior prompts parental attention and interaction, and provides a mechanism for transmitting appropriate cultural and social behavior. Although simple imitative behavior is evident in the postnatal period, by around 14 months infants remember and repeat actions they obser...

متن کامل

Learning Strategies for Coordination of Multi Robot Systems: a Robot Soccer Application

This paper presents a hybrid method for learning a dynamic strategy for a robot soccer team. In this method, an imitation learning scheme based on observed robot soccer games is used as a seed for an experience-guided learning scheme based on reinforcement learning. A lack in the application of classic reinforcement learning to the robot soccer problem is the high number of states to be analyze...

متن کامل

Believability Testing and Bayesian Imitation in Interactive Computer Games

In imitation learning, agents are trained to carry out certain actions by examining a demonstration of the task at hand. Though common in robotics, little work has been done in translating these concepts to computer games. Given that present-day games generally use antiquated AI techniques which can often lead to stilted, mechanical and conspicuously artificial behaviour, it seems likely that a...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2018

Imitation Learning with Concurrent Actions in 3D Games

نویسندگان

چکیده

منابع مشابه

Efficient Reductions for Imitation Learning

Is Bayesian Imitation Learning the Route to Believable Gamebots?

Born to Learn: What Infants Learn from Watching Us

Learning Strategies for Coordination of Multi Robot Systems: a Robot Soccer Application

Believability Testing and Bayesian Imitation in Interactive Computer Games

عنوان ژورنال:

اشتراک گذاری